Fahiem Bacchus
نویسندگان
چکیده
Markov decision processes (MDPs) are a very popular tool for decision theoretic planning (DTP), partly because of the welldeveloped, expressive theory that includes effective solution techniques. But the Markov assumption-that dynamics and rewards depend on the current state only, and not on historyis often inappropriate. This is especially true of rewards: we frequently wish to associate rewards with behaviors that extend over time. Of course, such reward processes can be encoded in an MDP should we have a rich enough state space (where states encode enough history). However it is often difficult to “hand craft” suitable state spaces that encode an appropriate amount of history. We consider this problem in the case where non-Markovian rewards are encoded by assigning values to formulas of a temporal logic. These formulas characterize the value of temporally extended behaviors. We argue that this allows a natural representation of many commonly encountered non-Markovian rewards. The main result is an algorithm which, given a decision process with non-Markovian rewards expressed in this manner, automatically constructs an equivalent MDP (with Markovian reward structure), allowing optimal policy construction using standard techniques.
منابع مشابه
A Modest, but Semantically Well Founded, Inheritance Reasoner
A modest exception allowing inheritance reasoner is presented The reasoner allows restricted but seman tically well founded defeasible property inheritance Furthermore it gives a well de ned and easily under stood semantic interpretation to all of the assertions
متن کاملGAC on Conjunctions of Constraints
Applying GAC on conjunctions of constraints can lead to more powerful pruning [1]. We show that there exists a simple heuristic for deciding which constraints might be useful to conjoin. The result is a useful automatic way of improving a CSP model for GAC solving.
متن کاملExtending the Knowledge-Based Approach to Planning with Incomplete Information and Sensing
In (Petrick and Bacchus 2002), a “knowledge-level” approach to planning under incomplete knowledge and sensing was presented. In comparison with alternate approaches based on representing sets of possible worlds, this higher-level representation is richer, but the inferences it supports are weaker. Nevertheless, because of its richer representation, it is able to solve problems that cannot be s...
متن کاملReasoning about Noisy Sensors and E ectors in the Situation Calculus
Agents interacting with an incompletely known world need to be able to reason about the e ects of their actions and to gain further information about that world they need to use sensors of some sort Unfortunately both the e ects of actions and the information returned from sensors are subject to error To cope with such un certainties the agent can maintain probabilistic beliefs about the state ...
متن کاملRepresenting and reasoning with probabilistic knowledge - a logical approach to probabilities
Now, we come to offer you the right catalogues of book to open. representing and reasoning with probabilistic knowledge a logical approach to probabilities is one of the literary work in this world in suitable to be reading material. That's not only this book gives reference, but also it will show you the amazing benefits of reading a book. Developing your countless minds is needed; moreover yo...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1999